DOCOBSIT BESHBB 



BD 135 642 



95 



IB 006 069 



AOTBOB 
TITLE 

INSIITaTION 

SEOliS AGENCI 

fiEEOfil NO 
pas DATE 
CONIfiACT 
NOTE 

£D£S EfilCE 
DESCfilfTOES 



ICBNTIflEfiS 



Itaspr Alfred Jr. 

Using Anclior Test Stod; Tables in State Assessient 
Prograis. 

EltIC Clearinghouse on Tests, Measoreientr and 
EvalQatioDr ErincetoOr N.J. 

National Inst, of Education (DflEN) , NashiugtoUr 

EHIC-TB-Se 
Dec 7 6 
ll00-75-00t5 

6p.; for the Anchor Test Study, see ED 092 601-634 

Mf*$0.83 BC*$t.67 Plus Postage. 

♦Edocat.lonal Assessient; EleAentary Education; 

*Eguatea Scores; Grade 4; GradeS; Grade 6; *Noris; 

Itav Scores; Beading Acbieveient; ^Beading Tests; 

Standardized Tests; *State Prograis; Test 

Interpretation 

^Anchor Test Stod; 



ABSIBACT 

This paper focuses on three topics. The first 
introduces the original Anchor Test Stud; condQC\:ed and reported by 
Educational Testing Service (ETS) froa 1971 to 1974. This stud^r 
involving the testing of lore than 300,000 children, produced rav 
score eguivalency tables for eight c^sionly used reading tests and 
nov individual and school *^iean noris tables for grades 4, 5, and 6. 
The second part describes Nashington state*s 1973-74 use of the 
Anchor Test Stud; tables to conduct a reading assessient based on a 
statevide saiple of sixth*grade students and 1974-75 efforts to 
develop coipoter prograis to facilitate greater practical application 
of the original tables. The final section describes advanta9es sbovn 
by the Nashington experience and presents suggestions aiied at 
laxiaizing the potential of the anchor approach to a state*level 
assessient of readiuQ achieveient. (Aothor/BC) 



* Docuients acguired by £BIC include lany inforial unpublished * 

* laterials net available froi other sources. EltIC lakes every effort * 

* to obtain the best copy available. Nevertheless, iteis of aarginal * 

* reproducibility are often encountered and this affects the guality * 

* of the licrcfiche and hardcopy reproductions ESIC lakes arailable * 

* via tite EfilC Docuaent fieproduction Service (EDItS) . EDBS is not * 

* responsible for the guality of the original tlocument. Iteproductions ^ 

* supplied by £DSS are the best that can be lade froi the original. * 



I ERIC CLEARINGHOUSE ON TESTS, MEASUREMENT, & EVALUATION 

IT"! I W# EDUCATIONAL TESTING SERVICE, PRINCETON, NEW lERSEV 08540 



CO 



TM REPORT 5fl 



DECEMBER 7976 



USING ANCHOR TEST SlUOY TABLES IN STATE ASSESSMENT PROGRAMS 

Alfred Rasp Jr. 



ABSTRACT 



I ^ 

-? 
So J 5 

W A ^ 

o25 



* oOa * > 
Ui 20 > - ^ 

i ^ 2 Z 
Z ^ ^ 2 ^ 
(tf ^ ft 2 O J O 
€ ■< 2 O _ - _ 



" a ^ " 



CO 

o 

CO 



*- C »- < w4 ^ <lt 



This paper focuses on three topics. The first introduces the reader to the origina] Anchor 
Test Study conducted and reported by Educational Testing Service (ets) from 1971 to 
1974. This monumental study* invoWingtne testing of tsaotB than 360,000 childreii, pro- 
duced raw score equivalency tables for eight coimnonly used reading tests and new indi- 
vidua!- and school-mean norms tables for grades 4, 5, and 6. 

The second part describes Washington State's 1973-74 use of the Anchor Test Study 
tables to conduct a riding assessment based on a statewide sample of sixth-grade stu- 
dents and 1974-75 efforts to develop computer programs to facilitate greater practical 
application of the original tables. 

The &ial section describes advantages and disadvantages shown us by the Wash- 
ington experience and presents suggestions aimed at maximizing the potential of the 
anchor approach to a state-level assessment of reading achievement. 



E ANCHOR TEST STUDY 



The powerful notion of accountability in education is not 
the direct focus of this paper, but it ser\'es logically as the 
starting point in a discussion of the development of the 
Anchor Test Study and the use of its results. Talk about 
educational accountability has been widespread for several 
years. The most cursoiy survey of the literature or the 
briefest of visits to a school's faculty room or to a local 
school board meeting or to a legislative budget hearing will 
confirm the continuing popularity of the concept. And al- 
though not eveiyone using the term can agree on its mean^ 
ing or what is required to achieve it, two aspects .are com- 
monly acknowledged. The first is the general concern for 
accomplishment. While it may be true that in the past 
educators concentrated their efforts on measuring and 
accounting for inputs rather than results in terms of stu- 
dent performance, today it is clear that both public and 
professional expectations extend well beyond accounting 
for inputs to an abiding interest in the achievement of 
students. 

The second commonly held idea grows ii am this concern 
for results: More and more grouos of private citizens and 
elective bodies are mandating formal and public reporting 
of the relative effectiveness of various local* st&te. and 
federal educational programs. 

This general demand tor accountability and the special 



interest in improving achievement and demonstrating 
program effectiveness has led to the Anchor Test Study. 
The specific motivating force was the desire to evaluate 
the s-jccess of the Elementary and Secondary Education 
Act <esea) Title I program. The disappointing results of a 
1968 evaluation attempt demonstrated vividly to the U.S. 
Office of Education the basic problems inherent in attempt- 
ing to aggregate reading achievement data gained from a 
wide variety of tests tacking statistical comparability. In 
1969, the feasibility of equating achievement tests in read- 
ingwas investigated, and in 1971, a contract was awarded 
to Educational Testing Service (kts) to carry out a study 
using one test as an anchor point for equating andnorming 
other commonly used reading achievement tests. In April 
of 1972 and 1973, data were collected on the eight tests 
that ultimately formed the basis of the wklely known 
Anchor Test Study (ats), pjblishedin final form as a tech- 
nical report consistingof 34 volumes and more than 15,000 
pages (1). 

Tn developing '^e anchor tables, ms carried out two 
operations: norming and equating. The norming phase 
was accomplished by administering the reading subtests 
of the Metropolitan Achievement Test to a total of more 
than 200.000 children in grades 4, 5, and 6. In '.he equating 
operation* about ISO^OOO children took pairs of the selected 



The material in this iniblication was prepared pursuant to a contraa with the National Instiiute of Educatfoni U S. Department of 
Health. Education and Welfare. ComraCtors undertaking such projecta under Govemmeni sponsorship are encouraged to express freely 
their judgment in professional and technical matters. Prior to publication^ the manuscrirt waa submitted to qualiHed professionals for 
critical review and determination of professional competence. This publication has mec such standards. Points of view or opinions^ 
however, do not necessarily represent the offtcial view or opinions of either thesf* *^ewers or the National Institute of Education. 



readtng tests. A total of more than 1,700 schoob and 
350^000 students participfitad in the study. 

The resulting nonns tables developed by crs provide 
transfonnations of the raw scored of the eight reading 
achievement tests to a single table of national percentile 
ranks and provide national, individual, and school mean 
norms for grades 4, 5, and 6. A listing of the tests, editions, 
forms, and levels included ui the study is presented in 
Table 1. 



Suggested Uses 

The equivalency tables and the individual student and 
school norms tables provide a versatile array of applica- 
tions in assessment and evaluation. A concise discussion 
of alternative uses is found in the ''Use of Tables" section 
of the pc^ular ats report prepared by Loret. Seder^ 
Bianchini and Vale 12. pp. 3-6). T%to exa:jiple3 taken from 
that discussion indicate the wide range of jwactical appli- 
cations. 

First, a comparison of individual student performances 
using scores from different tests: 

Problem— it is desirable to compare the reading 
achievement of three students: P^ter. Alan and 
Chuck (all 5th graders) have each taken a different 
reading test. Their Total Reading raw scores are: 



Peter (49 on eras). Aian (44 on cmh and Chuck (54 
on hat). To compare the Anchor Test Study national 
percentile ranks for these three pupils^ turn to table 
26. page 73, to find the norms for Total Reading 
score^ grade Find each pupil's score in the ''Raw 
score" column, then read across until you find the 
appropriate entiy under that test's name. Peter's 49 
yields an Anchor Test Study national percentile rank 
of 32 on the eras. Alan's 44 yields 18 on our, and 
Chuck's 54 yields an Anchor Test Study national 
percentile rank of 50. on hat. These Anchor Test 
Study national percentile ranks are now directly 
comparable because they are derived from the same 
norms sample. 

Second, a comparison of the performance of two or more 
schools with mean scores based on different tests: 

Problem^to compare the vocabulary performance 
of 6th ^de pupils at Classical Elementary (mean 
score ou sat. 29) and Lowell Element^ Schools 
(mean s«>re on cat, 26): White the^w'score school 
means are available for both Mh5oG7they are based 
on two different testa^^^a^lfe 31. page 87, contains 
the Anchor Test Study school mean norms for grade 
6, Vocahulaiy. Locate the mean raw score (29) for 
Classkal Elementary School and find thecorrespond- 
ing Anchor Test Study percentile rank and stanine 
in the column entitled "sat" (percentile rank of 72. 



TABLE I 



Test Edition 


Fonn 


Level Used at Grade; 






4 


6 


6 


California Achievement 
Tests (1970 ed.) 


A 


3 


3 


4 


ComprehenstveTestsof 
Basic SkUls 11968 ed.) 


Q 


2 


2 


3 


Gates-MacGinitie Reading 
Tests (19C4ed.) 


IM 


Survey D 


Survey D 


Survey D 


Iowa Teats of Basic 
Skms(1971 ed) 


5 


iO 


11 


12 


Metropolitan Achievement 
Testsn970ed ) 


F 


Elementary 


Intermediate 


Intermediate 


Se<iuential TesU of Educational 
Progress* STEi' Series 11 (1969 ed.) 


A 


4 


4 


4 


SRA Achievement Series 
(1971 ed.) 


E 


Blue edition 


Bhie edition 


Green edition 


Stanford Achievement 
Tesu(i964ed.) 


W 


Intermediate I 


Intermedtat«^ 1 1 


Intermediate 1 1 



2 



3 



stanme 6) . Now enter the same tablet by Locating the 
mean raw score L26) for Lowell f^Iementaiy School, 
and read the Anchor Test Study percentile rank and 



A STATEWIDE 



If it is possible to compare the achievement of individual 
students or schools using the ats norms tables, would it 
not then be possible to use the same procedures on a larger 
scale to develop an assessment of reading for an entire 
state? This was essentially the question asked by the P*^* 
gram Evaluation Section in the office of the Washington 
State Superintendent of Public Instruction during the 
summer of 1973 when the unofficial results of the ets 
efforts were first being discussed. The Watthington 
deliberations led to a positive course of action, and the 
desire to develop a state reading profile through the appli- 
cation of the Ats tables was incorporated into the State 
ESEA Title III needs assessment plan for fiscal year 1974, 
Support for this style of assessment rested on an interest 
both ^ generating a description of the reading performance 
of Washington pupils and in studying the feasibility of 
using the ats norms tables and local school district data as 
the basis for constructing a stateprofileof reading achieve* 
menL 

* When the Anchor Te$t Study Users Manual (unofficial 
version not including the Gates MacGinitie) was made 
available to the Washington Superintendent of Public 
Instruction in the fall of 1973t the plan was set in motion. 
In an effort to generalize reading achievement to the state 
as a whole and to categories arranged by size reflating 
school district enrollment, each common school containing 
grades 4, b, and 6 was assigned to one of 10 categories. 
Twenty percent of the schools were drawn randomly from 
each size category with ^n additional 10 percent sample of 
schools drawn as alternates. A questionnaire was prepared 
and sent to all school districts to collect information related 
to the use of the tests included in the ats tables. Because 
the survey showed that more of the ats tests were admin- 
istered at the sixth grade than at the fourth or the fifth, 
gradp 6 was selected for analysis* and the sampled schools 
were checked to see where replacements would be required. 

Requests for the raw scores of sixth-grade students were 
sent to the selected schools, and as the resulting data were 
tabulated, four circumstances became apparent: 1) Several 
districts did not complete the Anchor Tf^t Profile Survey 
accurately and did tiot possess information as claimed, 2) 
The test results were submitted in a greater variety of 
forms than was anticipated, especially in the way scores 
were reported, for example, percentilest stamnest grade 
equivalencies, and growth scores were received in t:ddition 
to the raw scores requested, 3) The times of test adminis- 
trations covered every month f^rn September to June, 4) 
Although 87 schools and 6,568 students were incltided, 
insufficient appropriate data were available to maintain a 
20 percent random sample in each of the 10 size categories 
as a basis for generalization. The problemsof data analysis 



stanine in the column entitled "cat" (percentile rank 
of S% stanine 7), These Anchor Test Study ranks 
may now be compared. 



were greatly increased because of the effort maintain 
some semblance of a random sample* and in many in* 
stanceSt precision suffered as a consequence of dealing 
with the lack of compatibility in test forms, levels, editions 
and time of test administration. 

Although the Washington study resulted in a somewhat 
limited description of reading perfomiance. It di^ produce 
a profile of reading achievement and identified ^ number of 
procedural problems wliich could be remedied in futurts 
assessment programs. The results of the teeing assess* 
ment are displayed in Table 2 which shr i an analysis of 
sixth-grade reading scares using school norms, A more 
complete discussion of the Washington experience is pre* 
sented in a technical report titled: Washingtcn Statewide 
Assessment Using Anchor Test Worms (4), 

The Development of Computer Programs 

The outcome of the 1973*74 study was positive enough to 
encourage the Washington evaluation staff to consider 
furtheruseof the ATS tables on the state level. In 1975^ we 
developed computer programs to facilitate the use of the 
ATS tables for both state and local assessment purposes. 
The Northwest Regional Educational Labot^toiy assisted 
the state office in writing programs to provide score trans- 
formations among the eight tests and conversions between 
fall, winteTt and spring norms. The resulting programs 
accomplished three key purposes. First, the ats equiv* 
alency tables were prograrruned into the computer so that 
test scores could be equated quickly. However* since the 
original ats tables reported only raw scores and spring 
norms* tht*y were of limited use for large*sca]« assessments 
based on existing data. To provide greater flexibility, two 
additional steps were taken. Tables were developed and 
programmed to convert foil and winter testing times to 
spring norms. The testing time convo^rsions assumed linear 
growth; for example, if a student was achieving at the 46th 
percentile in the Mt a straight line projection (with score 
increases spaced equally between intervals) was made to a 
spring percentile of 46, (This assumption introduces the 
possihitity of error but is commonly used in large-scale 
assessments and program evaluations,) Tables were also 
programmed to convert the standard reporting options-- 
for example* grade equivalent scores, percentiles* and 
scale scores— to raw scores. 

The practical utility of the original ats accomplishments 
is enhanced by the additional progran^s. The following is 
taken from the Washington User's Guide to the Anchor 
Test Program {3 p, 3| to illustrate their usefubiess: 

Forexamplet School A may report grade equivalent 



4 



scores from PtiU tMting with the California Achieve- 
uent Test^t while Scbool B reports raw soom for 
tlM ecme time and te»t. School C may use Spring 
percentiks from the ^owa Teete of Basic SkillSt while 
School D hft? Spiing raw scores fcosa. the Stanford 
Achkrvemtet Tests. By using the Aochor Test Pro* 
gram^ these schools can now communicate meaning- 



fully with each othor about these test scores. 

Efforts are now uiKlerw^ to make the Anchor Te^ Pro* 
gram available to those Washington school districts imd 
other ngoncies of the couunon school district that have 
computer installations. 



TABLES 
Washington Assessment 
Ofade Six Total Reading Scores 

Estimated State and Size Category Means and Standaixi 
Dictations for Six Standardized Tests (School Norms! 



Distnct 

t>iZB 


Standatdlsed Reading Te^ta 


CTBS 


ITBS 


MAT 


SAT 


SRA 


STEP II 


20,000 asd over 


46.4 


61.6 


63.6 


62.3 


53.6 


42.1 




6.6 


8.9 


7.6 


8.8 


7.9 


4.5 


10,000-19.999 


vZ.l 


57.0 


60.0 


57.8 


49.7 


40.1 




7.8 


10.6 


9.2 


10.6 


9.6 


6.6 


5,000- 9.999 


49.6 


65.8 


67.3 


66.6 


57.4 


44.4 




4.9 


6.8 


5.2 


6.6 


5.7 


3.0 


3.000- 4,9S9 




64.7 


66.0 


65.0 


66.6 


43.4 




4.9 


7.2 


5.3 


6.6 


6.6 


2.7 


2,000 - 2,999 


52.9 


70.7 


70.6 


70.9 


61.0 


4'' .2 




7.1 


10.2 


6.9 


9.4 


31 


4.0 


1.000- 1,S99 


46.6 


61.9 


64.2 


62.6 


54.6 


42.5 




4.6 


6.4 


5.3 


6.2 


5.6 


3.1 


700- 999 


43 6 


68.3 


60.9 


69.4 


50.5 


40.6 




8.2 


11.7 


9.6 


12.0 


10 " 


5.7 


500- 699 


46.4 


61.4 


64.0 


62.2 


53.4 


43.1 




3.3 


4.5 


3.6 


4.3 


4.4 


0.9 


300- 499 


41.9 


55.8 


57.4 


50^ 


46.2 


38.3 




17.7 


22.9 


23.1 


18.6 


23.7 


14.2 


Under 300 


50.2 


67.1 


67.6 


66.6 


58.4 


44.9 




3.4 


U.4 


9.8 


10.7 


9.9 


6.8 


State 


47.0 


62.4 


64.1 


62.4 


56.0 


42.6 


(AU Schools) 


7.4 


10. 1 


8.6 


9.4 


9.1 


5.1 


National ATS 














Median Scores 


46.8 


62.0 


64.8 


63.0 


54.2 


43.0 



Note— The first number represents the mean. Second number represents the st indard deviation. 
Althoughjc^resoncATwerenotreportedt cATstateanddtrstom means can be estimated from the 
data U9uig Educational Testing Service eqttivalenqr tables, cat means for large districts to the 
state respectively are approximated as follows; 44,40.6, 46.6,45.6, 60,44,41, 44. 39,47, and 44. 



ADVANTAGES AND DISADVANTAGES HIGHUGHTED BY 
THE WASHINGTON EXPERIENCE 



The Washington experience has shown u3 the advent^gos 
and disadvantages of using the ats tables to conduct^ a 
statewide reading assessment. Some of the problems faced 
in Washington State arepeculiar to that setting, but others 
generalize to a broader range of aituations. For example, 
unless a state requires that local dbtncts use tests in the 
ancfaorstudy, you can anticipate a sampling problem. It is 
highly improbable that the distribution* acioss known 
relevant variables, of districts or schools using compatible 
anchor tests would be wide enough to ensure that a random 
draw would select only units with the desired test infor- 
mation. Sampling was a major problem in Washington, 
Even with an initial 20 percent sample in eaco sizecategory 
and an additional 10 percxmt replacement sample* the 
schools in the final saniple ranged from a low of 6.5 percent 
in one category to a high of 11.6 percent in another, (See 
Table 3,) This toss of original sample units limits, to an 
unknown degree, the ability to generalize from the state 
results- The state profile of reading is overly influenced by 
those size categories with higher percentages xinkss the 
results are weighted to more accurately reflect the popula- 
tions involved. Certainly in the district size categories 
where the number and percent of sampled schools is smalls 
the stability of the achievement estimates must be seri- 
ously questioned. The ability to generalize to the entire 
population with confidence is directly affected hy the 
degree to which the sample lacks ptecisun. 

Obtaining an accurate description of available test data 
at the local district or school levd presents another prob- 
lem. Easy use of the ats tables depends not only on the use 
of an anchor test but on the use of .he appropriate form 
and level as well. In addition* an accurate reccrd of admin- 
istratton times and the available te^t-nisults reporting 
options— raw scores* paxentileSt stac'n^i, and so on— is 
crucial planning mformation. The logistics of data collec- 
tion also pose problems , Not that districts fail to cooperate, 
but that test data are frequently suppUed in many "shapes 
and sizes*' and the clerical sorting task b monumental. 
The computer programs developed by the Washington 
State office* however* help to solve many of the processing 
and analysis problems stemming from the wide array of 
test results generated at testing tunes other than spring, 
and reported in options other than raw scores. 

There are other limitations to the use of the ats in state- 
wide assessments. The tables limit the assessment to the 
reading areas* total scores and subtests, and to three grade 
levels. In addition, since the test items are already selected 
and organized into standardized tests, there is no oppor- 
tunity to add or subtract items or the objectives they mea- 
sure. The achievement assessment is limited to what the 
eight tests cover* and the items in these tests have been 
used because they discriminate in a norm^referenced way, 
not because of their relevance to program objectives, 

A final limitation stems from the original parameters of 
the Anchor Test Study itself. Eight test editions served as 



the basis of the effort. Two of the tests* the ctbs and Stan- 
ford* have already been revised, with new editions planned 
for several others in the near futuie. Unless the current 
tables are expanded or the test publishers themselves pro- 
vide precise bridges between editions (a rather unlikely 
event)* the current tables ^ soon be outdated and their 
usefuhte^s limited. 



Effidency a Majw Advantage 

In the face of these limitations^ there is still a very potent 
advantage inherent in the anchor test approach to the 
state-level assessment of reading^ and this is the efficiency 
and low cost of this style of testing. The anchor tests were 
selected for inclusion in the equating and norming proce* 
dure because they are widely used achievement tests. It is 
probable that in any state most scfaoola administering 
standardized achiev^ent tests make use of one of the 
popular anchor tests as part of the regular testing program. 
To the extent that this is true, no new testing is required. 
Local sampled scIknjIs need only send copies of scores to 
the state office for tabulation. This means that the state 
assessment program can build primarily on existing local 
test data and that no apecific test need be mandated by the 
sicate ^ency or legislature. The resulting assessment pro- 
gram presents a low profile, is unobtrusive, and requires 
only a limited amount of staff time and relatively few dol- 
lars. This basic advantage, while not responding to all of 
the limitations, is extremely powerful in a time when 
educational resources are becoming scarce and the demand 
for public accounting widespread and influential. 



Suggestions for State Level Assessment 

If the purpose of a state reading assessment is to produce 
statement^ comparing the state4evel performance or 
achievement of students to national norms and/or to make 
broad comparisons among selected educational groupings 
witiJn the state* the low cost and efficiency gained by 
using the ats tables are worth careful consideration. The 
foUo^i^ig suggestions point out some of the major steps 
that cai. betaken to implement areadingassessment based 
on the Ats tables that interferes only minimally in the 
af&irs of local schools and requires only limited resources. 
To avoid peak load problems in staff time, approximately 
IS mon ths should beallowed for the process, with the start - 
ing point in late winter or early spring. This seemingly long 
perk>d of time will prove ben^icial to both the state office 
and local districts. 

Step L An accurate description of each district*s stan^ 
dardized testingprogram forgrades4» 5, and 6 is required. 
Some states may have this on file, but in most cases* local 
districts will need to be contacted to gain the necessary 



TABLE 3 
Washington Assessment 



Distnct and Sa mple Sizes Used in the 
Anchor T^t Study Data Collection Effort Based on 
1972 School Census Data 





Number of 


Number of 






N umber of PupUs 


Schools with 


Schools in 


Percent 


Number of 


in District 


Grade 6 


Sample 


In Sample 


Students 


2O1OOO and over 


22G 


23 


XO-2 


1747 


10, AOO- 19,999 


162 


17 


10-5 


1469 


6.000- 9,999 


142 


16 


10.6 


951 


3.000- 4.999 


78 


6 


7.7 


656 


2.000- 2,999 


4$ 


6 


10.4 


764 


1.000- 1.999 


69 


6 


8.6 


252 


700- 999 


34 


4 


xi.a 


229 


500- 699 


2a 


3 


10.7 


149 


300- 499 


42 


3 


7.1 


86 


under 300 


^ 92 


6 


6.6 


66 


TOTALS 


911 


87 




6668 



information^and protocol in making the contacts is itn^ 
portant. The survey should collect at least the following 
data by March: 

—names and editions of tests 
— form* and levels of tests 
^grades and times of administration 
^typeof "results" available for students and schools 
—an indication of anticipated changes for the next 
school year 

^the name and phone number of the district test 
coordinator 

Step 2. Since* given the assessment purpose mentioned 
ahove. litOe is gained by testing every pupil at a selected 
grade level, a sample should be designed that wilt provide 
the generalizability and precision desired. If the analysis i^ 
to focus on schools, schools should serve as the sample 
unit, and data can be collected in the form of scores for all 
students in the sample schools at the selected grade level 
or in the form of school mean scores fo** the sampled 
schoolsn If the primary interest is in comparing state stu- 
dent achievement to national norms* there may be a special 
interest in a two-stago sampling process that first selects 
schools and then selects a sample of students within the 
schools. This process is more timeconsuming to implement 
but requires the involvement of fewer schools and students 
in establishing the state profile^ Perhaps the easiest and 
most straightforward process* whether the focus is on 
schools or students, ;s to collect data from dll children in 
selected sciiools. In any event the sample should be drawn 
by March. 



Steps. After the district survey information is analyzed 
and thetest coordinators contacted for necessary clarifica- 
tions, the sampled schools should be matched with the 
survey resultfi. This matching process will quickly deter- 
mine which of the sampled schools lack appropriate testing 
programs. Since computer programs are available to con- 
vert f^l ^nd winter data to spring norms (following 
straight line projections which may add to the unreliability 
of results) and to transform all standard results-reporting 
optiuns to raw scores, the crucial elements of the match 
are correct test editions* formSf and levels. 

Step 4, If the number of randomly selected schools with- 
out compatible test data is too lai£:e (more than 40 to 60 
percent), the efficiency advantage ot the ats nrKXlel will be 
bst. In this case, another assessment strategy should be 
investigated. Assuming, however^ that a solid majority of 
schools fits the desired pattern, meetings should be held 
with officials of the discrepant schools to plan a positive 
course of action for the coming year. Tht' contact needs to 
be made in the spring— April or early May— so that ade- 
quate implementation time is provided for changes or 
"add ons" to local testing programs to incorporate one of 
the ATS tests into the testing schedule. This step holds the 
key to success and is more a human relations activity than 
a technical one. 

The solution may be unique tji each district. In some 
cases* it may only require a slight alteration in the distnct. 
or school testing program; in others* the loan of tests from 
one district to another maybe theanswer. Thesta^eagen^ 
may find it convenient to actually provide some of the tests 
and scoring services. The fact that eight different tests can 
be used greatly alleviates the problem. As a last resort, if 



7 



there is some evidence for ^up|)ortmg the assumption that 
there is no systematic achievement bios between schools 
using one of the anchor tests and those which do not* a 
limited number of alternate schools cari be used without 
seriously affecting the representativeness of the sample. 

Step 5. As soon as the final "go*" decision is made, the 
companies publishing the tests in the Anchor Test Study 
tables should be contacted to provide the related technical 
manuals. All other necessary materials should also be 
ordered so that there will be no holdup during the process- 
ing or analyzing phases. 

Step 6^ Early in the fall, all sampled schools should be 
contacted directly with specific instructions regarding 
data collection. This memorandum should build on the 
previous year's survey response and present an exact 
course of action, always stressing the importance of the 
schools* contribution to the state assessment. Making the 
*'Hawthorrie effect'* explicit is an intricate part of the 
strategy. Since the sampled schools will be using a variety 
of testSf and testing at different times* the deadline for the 
submission of aata will vaiy but should be clearly estab^ 
lished for each gvoup of schools using a similar pattern. 

Step 7. The state office clerical staff should be trained to 
review the content and quality of the data as they are re- 
ceived and to monitor the due dates- The goal is to routin- 
ize the data collection and processing as much as possible. 
Most of the materials wil] be accumulating in November 
and December after the fall testing* and in May and June 



after the spring testing, so card punch and computer time 
should be scheduled accordingly. 

Step 8. On.e the data are processed* the development of 
the results tables* including means and standard devia-^ 
tions* can take place. This is a technical job* but if thedata 
have been screened carefully as received, there should be 
little problem. The predominate concern will be to prepare 
a puhlic report on the assessment that is clear and concise 
and that does not generalize beyond the power of the data 
or the rigor nf the sampling. The issue of sampling and the 
power to generalize is crucial in this time of full disclosure* 
when both the media and the public demand access to 
infonnation regardless of its technical quality and fre- 
quently use it in ways unintended or beyond intent. 



A Final Statement 

The anchor test approach to reading assessment on a state 
level is a workable one if one can tolerate its limitations- 
limitations brought by the uneven distribution of ATS test 
users, hy well intended but inaccurate information* by the 
focus on grades 4* 5, and 6* and by the technical and pro- 
cedural problems previously discussed. If the conditions 
can be endured or overcome, this approach can produce a 
reading achievement profile for a state and do it in a way 
that is not disruptive in local schools or costly at the state 
level. 



REFERENCES 



1. Bianchini* J. & l^ret. P. Anchor test study* Final re- 
port Berkeley: Educational Testing Service, 1(^74. 
34 vols. 

2 Lorei. P.. SedtT. A.. Biatichini* J. & Vale. C. Anchor 
test study ^equivalence and norms tables for selected 
reading achievement tests. Washington* D.C.: U.S. 
Government Printing Office* 1974. 
User's f^uide to the anchor test program^ Olympia. 
Washington: Superintendent of Public Instruction. 
1975. 

*r W(zshington statewide assessment using anchor test 
norma total readia^ grofle six, technical report. 
Olympia. Washington: Sut>erintendent nf Public In* 
struction. 1974. 



8 



